Inadequacy of Taylor series for perturbation expansion: 
a lesson from weak measurement 

Antonio Di Lorenzo* 
Universidade Federal de Uberldndia, Uberldndia, MG, Brazil 

Abstract 

We all learnt from calculus textbooks that a Taylor expansion should be made 
consistently: all first-order terms should be grouped together, then all the second- 
order terms, etc. However, when applying perturbation theory to estimate proba- 
bilities, a common task in quantum mechanics, this automatic procedure can lead to 
nonpositive-definite probabilities. (Here, we are talking about probabilities that must 
be positive, as they can be inferred directly from the frequency of observed events, 
and not about intermediate functions, as Wigner quasiprobabilities, the inference of 
which from experimental data is a nontrivial task.) We demonstrate how to preserve 
the nonnegativity of probabilities at the cost of getting a bad grade in Calculus, and we 
show how the corrected expansion leads to a modification of the commonly accepted 
expressions for weak measurements, curing unphysical divergences at the same time. 
We provide the corrected formulas in the trivial case of an instantaneous interaction, 
the most commonly studied, and in the case of a finite-duration interaction during 
which the measured observable is not conserved. In the linear regime, the response of 
the detector is shown to be given by a generalized Kubo formula for a non-Hermitian 
perturbation. 
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WHAT IS A MEASUREMENT? 



Some may have a very restricted idea of a measurement, limiting the concept to the case 
when a single observation on a system allows to infer total information about a dynamical 
quantity of interest in a second system that interacted with the first in a controlled way. 
For us, instead, a measurement is any act of inference about a system done by observing 
another system (which can be called indifferently the detector, the meter, the probe, the 
apparatus, or the ancilla) that has interacted with the measured system, even though this 
inference has some uncertainty connected to it. 

From a strict philosophical view, we never observe anything but our sensations or per- 
ceptions, i.e. the consciousness of sensations, that are given to us in an immediate way, 
according to Kant, 1 who stated: "Things in space and time are given only in so far as they 
are perceptions (that is, representations accompanied by sensation)-therefore only through 
empirical representation". Thus, in principle, we should follow the observed system, and 
then the measuring apparatus that interacts with it, hence the visible electromagnetic fields 
that interact with the former, and then our retinas that interact with the e.m. field, etc., 
in a chain with a blurred end due to ill-defined terms such as "consciousness" and "sensa- 
tion". As von Neumann noted, 2 in quantum mechanics there is no need to go up this chain. 
Sometimes, it is sufficient to apply the rules of quantum mechanics directly to the system. 
This turns out to be the case of a projective measurement. Most of the other times, it is 
necessary to go up just one step, by describing both the system and the detecting apparatus. 
Consistency requires that the former case is a limiting case of the latter. It is important to 
remark that in this second case we can not make definite inferences about the system, but we 
have to leave some room for uncertainties. These uncertainties may be classical, in the sense 
that we do not know the exact state of the detector, that the readout has always some noise, 
etc. But they can also be quantum, if the detector is prepared in a superposition. Observing 
the consequences of the quantum nature of the detector, however, is not immediate. 

In a seminal work, 3 Aharonov, Albert, and Vaidman showed that by having a weak inter- 
action between system and apparatus, the average output of the latter could be much larger 
than the maximum eigenvalue of the observed quantity (times the amplification factor). 
Soon, it was realized ' that the quantum nature of the detector was at the basis of this phe- 
nomenon, as constructive interference in the probability tails for the readout led to the large 
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average values of the output. This does not always happen, however: the observed system 
must subsequently undergo a second measurement, on the output of which the result of the 
first one is conditioned. This procedure is known as postselection. When the postselected 
state is almost orthogonal to the initial state of the system, the anomalous average output is 
observed. However, as the initial state of the system is only slightly perturbed by the weak 
interaction, the probability of a successful postselection is very small in this regime, so that 
one has to be patient and collect a huge amount of data, before getting his reward. 

The (rare) amplification of the output certainly contributed to popularize the subject of 
weak measurement, but whether it lead to a net gain in signal-to-noise or in the amplification 
is somewhat controversial. ' 6 The worth of postselected weak measurements, instead, could 
reside in its allowing sequential or joint non-projective measurements, 7-11 in its providing 
new techniques to perform quantum state tomography, 12 " 15 or in its possibly leading to the 
disembodiment of dynamical properties from their physical vehicles. 16-19 

METHODS 

The weak measurement formalism is but perturbation theory applied to the composite 
system formed by the measured system and by a second quantum system, the detector. To 
avoid confusion, we shall say "total system" when referring to both the detector and the 
measured system, and "system" when referring to this latter. The perturbation is done in the 
interaction between the two subsystems, while the evolution due to the free Hamiltonians is 
considered to be easily computable. Furthermore, contrary to textbook perturbation theory, 
the formalism is applied to conditional probabilities, after a postselection on the system is 
done. As observable probabilities are nonnegative-definite, particular care should be taken 
while making the perturbative expansion, in order to preserve this essential property. In 
particular, as we show below, the common knowledge that all terms of a given order are to 
be grouped together does not apply. 

A simple case: instantaneous interaction. 

For simplicity, we make the following hypotheses: 

1. the interaction between system and detector is a (von Neumann) instantaneous inter- 
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action; 



2. the probability is calculated immediately after the interaction; 

3. the preparation of the system and the probe p« <g) po is known immediately before the 
interaction; 

4. the postselection in the state p/ <8> 1 is made immediately after the interaction. 

The extension to an arbitrary case is obvious, except perhaps for point 1. 20 > 21 

The joint probability of postselecting the total system in pf <g> \R)(R\ (we assume that 
the measurement on the detector is sharp, so that Born's rule applies) is then 

V(R,p f ) = Tr [(p f ® \R){R\)U{ Pi ® Po )U% (1) 

with the time-evolution U = exp[i\AX]. We kept the interaction in the von Neumann 
protocol, with A observable of the system and X observable of the meter and A a coupling 
constant. However, notice that, contrary to the von Neumann protocol, we are not assuming 
that the readout variable R is conjugated to X, nor that the meter is initially in a sharp 
state of the readout p ~ \R = 0)(R = 0|. 

The probability of postselecting the system in pf is 

v( Pf ) = j2nR,Pf)- (2) 

R 

Let us apply perturbation theory , to Eqs. (1) and (2), including up to first order terms in 
the propagator 

V(R,p f ) ~Tr{(p/® \R)(R\) [l + iAAx] (p;®p ) [l - iAAx] } , (3) 
V(p f ) ~ Tr | ( Pf ® 1) [l + i\AX\ ( Pi (8) po) [l - i\AX\ } . (4) 

The textbook calculus approach would be, e.g., to retain the first order terms, 

V 1 (R,p f )=Tr s [pfP l ](R\po\R) + {i\Trs[pfA Pi )(R\Xp \R) + c.c.} , (5) 
Vt(pf) =Tr s [p fPl } + {i\Trs\pfApi]Tr D [Xpo]+c.c.} . (6) 

However, the above expressions do not preserve the positivity of the probability, since they 
are not of the form V = Tr [EUiFJj\\ with E, F positive operators and U\ an arbitrary 



operator. True, the neglected terms are C(A 2 ), but nevertheless the probability could turn 
negative for some values of pf and R if to lowest order P < 1. This can occur if A is 
somewhat largish. The correct way to make the expansion is to keep the product of the first 
order terms in IA and W , giving 

V(R, Pf ) ~Tr s [p f p i ](R\p \R} + X^ s [p f Ap i ](R\Xp \R) + c.c^ 

+ X 2 TT S [p f A Pl A](R\Xp X\R), (7) 
V{p f ) ^Tr 5 [p /P ,] + [i\ Ti s [p f A Pl ] Ti D [Xp ] +c.c.} + A 2 Ti s [p f A Pl A]TT D [X 2 p ]. (8) 

Thus, up to "first order", the system enters the probabilities with three terms: the over- 
lap O = Tr s [pfPi], the complex number A w = Tr s [pfApi}/0 and the real number B w = 
TYs[pfApiA]/0. If one considers, as usually done in the context of weak measurement, the 
conditional probability Q(R) = V(R, Pf)/V(pf) and its related averages, the overlap O can 
be simplified between numerator and denominator. While mathematician will shudder in 
disgust, O may as well be 0, and the formulas still be valid, in the sense that in this limit 
B w is overwhelmingly large compared to A w , in both the numerator and the denominator. 
Another point in favor of this improved expansion is that while OA w tends to for O — > 0, 
the product OB w stays finite, excluding some trivial cases for which the probability of post- 
selection is exactly null. For this reason, the expansion is robust for any preparation and 
postselection of the system. 

But what does a naive application of Taylor series, as learnt from Calculus, pre- 
scribes? Since we are including a second order term, according to the prescription, for 
consistency we should expand the propagator up to second-order, and retain terms like 
A 2 Tr (p/ ® \R){R\)A 2 X 2 (pi <g> p ). We did this in a previous paper, 22 where we stated er- 
roneously that the reason to neglect these terms, which give rise to a complex number 
C w = Tis[pfA 2 pi]/0, was that the second order correction becomes relevant only in the 
regime C w <C B w . From the discussion above, instead, it can be seen that dropping of 
C w is justified by the positive-definiteness of the probability. If we wanted to retain terms 
X 2 A 2 X 2 in the propagator U, we should retain some A 3 and A 4 terms in the probability. 
Paradoxically, as shown in Fig. 1, including the second order terms in the Taylor series 
according to the textbook prescription may make the probability more negative than the 
first order approximation. 
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Finally, the conditional average of an arbitrary observable R is obtained by integration, 

(R) „ (R)o + i\A' w ([R,X]) - XAl({R,X}) + X 2 B W (XRX) 
l-2XAl(X) + X 2 B w (X 2 ) 

where (M) = Tr D [Mp ], [M, N] = MN - NM, and {M, N} = MN + NM, A' w = Re(A w ) 
and A" w = lm(A w ). Notice that we did not assume anything about the spectrum of R: it 
may be continuous, discrete, or mixed. 

In particular, in the limit XA'^(X) <C 1, \ 2 B W {X 2 ) <C 1, it may be possible to retain the 
first order terms, provided that B w is not too large, i.e., provided the overlap between the 
preparation and the postselection is not too small. Then the result of Ref. 23 is recovered 

(R) ~ (R) + iXA' w ([R,X}) - XA'i [({R,X}) -2(R) (X)o\ . (10) 

Remark the appearance of the symmetrized correlator. In any case, there is no a priori 
reason to prefer a polynomial in A to the rational function of Eq. (9). In particular, it 
may seem that making a first order approximation, while inadequate for a probability, that 
is nonnegative, could be appropriate for the average value. However, let us consider the 
following extreme example: the detector is a two- level system, whose output can be R = —1 
or R = +1. The average value is a simple function of the probability (R) = 2P(R = 1) — 1. 
If the probability takes a negative value, (R) becomes smaller than -1, which is impossible. 



General case: Interaction with a finite duration. 

Now we lift the hypothesis 1 made in the previous subsection, i.e. we allow the interaction 
to last a finite time. Contrary to previous works, 20,21 we shall not assume that A or R is 
conserved. This generalization is particularly interested in prospective solid state realizations 
of weak measurement. 24-30 The interaction is shaped by a function g(t), satisfying J dtg(t) = 
1 and vanishing outside a time window [0, r\. For convenience, we factor out the interaction 
strength A, so that 

H mt = -h\g(t)Ax. (11) 
Hence, the time-evolution operator is 

U 



W r|exp iX^ dtg{t)A{t)X{t) j, (12) 
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FIG. 1. Comparison between Taylor series and the controlled approximation. The 

plot shows the probability for a measurement of the z-component of a spin 1/2. The solid line 
represents the exact result, the dotted line the controlled approximation given by the ratio of 
Eq. (7) and Eq. (8), the dot-dashed line the first-order Taylor series, and the long-dashed line the 
second-order Taylor series. The spin is prepared in a pure state making an angle ir/3 with the 
z-axis, measured weakly, then postselected in a state making an angle ir/2 with the z-axis and 
coplanar with z and the preparation polarization. The probe was initially prepared in the pure 
state ip(R) = cos (4R/A) exp [— R 2 /4A 2 ]. The interaction between the probe and the spin was 
H int = -H6(t)\a z X, with [X, R) = i. We took A = 0.2A. 

with T time-ordering, = 14^ <8> free evolution of system and detector, while 
A(t) = Uf^AUt X(t) = u[ D ^ XU^ are the operators in the interaction representation. 
As in the previous subsection, we make a controlled expansion to "first order" of the joint 
probability: 



V{R, Pf ) -Trjwft ( pf ®\R}(R\)uM 

x (pi ® Po) 



1 + iX / dtg(t)A(t)X(t) 



1-iX dtg(t)A(t)X(t) 



Tr s [p f (T)pM(R(T)\p \R(T)} + \ 



i / dtg(t)A w (t)(R(r)\X(t)p \R(r)) + c.c 



+ \ 2 I I dtdt'g(t)g(t')B w (t,t')(R(r)\X(t)poX(t')\R(T)) \. (13) 



We put \R(t)){R(t)\ = U D)j \R}(R\U D) and p f (r) = U S)j p f U S) the state \R)(R\ and p f , 
respectively, propagated backwards in time to the beginning of the interaction, and defined 
the time-dependent weak values 

K(t) J*&£¥®£ , (14) 

^s\Pf{r)pi\ 

_ Tr s [p f (r)A(t) Pi A(t')] ( . 

Trs[pf(r)pi\ 

Consequently, the probability of postselection is 

V( Pf ) -TtsMt)*]^ -2\£ dtg(t)(X(t)) K(t) 

+ A 2 £ J* dtdt'g{t)g{t') (X(t')X(t)) B w (t, *') |. (16) 

The conditional average is then 

(R) ~ |l - 2\£dtg(t){X(t)) A'^t) + X'JJ^dtdt'g^git'^X^Xit^A^t') 
x (<£(t)> + A fdt^*) f^(t)(^[i?(r),X(t)]) -<(t)({ J R(r),X(t)}} 



+ A 2 ^^W^(0^(^0(^(i')^)^(0>o}- (17) 
In the linear regime, we have the modified Kubo formula 

(R)~(R(r)) + i f dt(W(t)R(r)-R(r)W\t)) (18) 
Jo 

with the non-Hermitian operator 

W{t) = \g(t)A w (t)[X(t) - (X(t)) }. (19) 

Notice that when no postselection is made, pf = 1, the weak value A w {t) = (A(t))i is real, 
and the ordinary Kubo formula is recovered. 

CONCLUSIONS. 

We have derived a general formula for the weak measurement, providing positive-definite 
probabilities by construction, and giving an interpolation to the average output of an arbi- 
trary variable for arbitrary preparation and postselection. 
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